Semi-automatic Discovery of Mappings Between Heterogeneous Data Warehouse Dimensions

نویسندگان

  • Sonia Bergamaschi
  • Marius Octavian Olaru
  • Serena Sorrentino
  • Maurizio Vincini
چکیده

Data Warehousing is the main Business Intelligence instrument for the analysis of large amounts of data. It permits the extraction of relevant information for decision making processes inside organizations. Given the great diffusion of Data Warehouses, there is an increasing need to integrate information coming from independent Data Warehouses or from independently developed data marts in the same Data Warehouse. In this paper, we provide a method for the semi-automatic discovery of common topological properties of dimensions that can be used to automatically map elements of different dimensions in heterogeneous Data Warehouses. The method uses techniques from the Data Integration research area and combines topological properties of dimensions in a multidimensional model. Index Terms Data Warehouse, P2P OLAP, dimension integration

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On the Use of Dimension Properties in Heterogeneous Data Warehouse Integration

A new trend in Business Intelligence is the process of combining information from two or more different and heterogeneous Data Warehouses. Existing solutions rely mostly on the Extract-Transform-Load (ETL) approach, a costly and laborious process. The process of Data Warehouse integration can be greatly simplified by developing methods to semi-automatically discover semantic mappings among attr...

متن کامل

A Semi Automatic Tool For Schema Mapping

neric mapping framework at the schema level to address the problem of schema interoperability Providing a formalism for developing a generic, extensible, and semi-automated mapping A semi-automatic tool for schema mapping. at the University of Washington in Seattle, where he founded the database group. on Clio, the first semi-automatic tool for heterogeneous schema mapping. Keywords: data integ...

متن کامل

An a Priori Approach for Automatic Integration of Heterogeneous and Autonomous Databases

Data integration is the process that gives users access to multiple data sources though queries against a global schema. Semantic heterogeneity has been identified as the most important and toughest problem when integrating various data sources. Several approaches were proposed to deal with this problem. These approaches can be classified using three criteria: (1) data representation which mean...

متن کامل

Ontology-Based Conceptual Design of ETL Processes for Both Structured and Semi-Structured Data

One of the main tasks in the early stages of a Data Warehouse project is the identification of the appropriate transformations and the specification of inter-schema mappings from the data sources to the Data Warehouse. In this paper, we propose an ontology-based approach to facilitate the conceptual design of the back stage of a Data Warehouse. A graph-based representation is used as a conceptu...

متن کامل

Lightweight information integration through partial mapping and query reformulation

The growing amount of structured information becoming available, fostered by the advent and development of e.g. the Semantic Web and the Web 2.0 approaches, raises the need for (semi-)automatic, flexible and adaptable integration solutions. The effort invested into this partially manually created content can be leveraged by re-use and integration, so that additional communities of users can tak...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011